04:00
2026-06-04
arxiv.org
artificial-intelligence
GroupToM-Bench: Benchmarking Group Theory of Mind and Nonlinear Social Emergence in MLLMs
Multimodal large language models fail to infer how individual mental states interact and crystallize into group-level outcomes, according to a new benchmark called GroupToM-Bench. The benchmark, the fโฆ